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Abstract. We prove that orthogonal constructor term rewrite systems and A-calculus 
with weak (i.e., no reduction is allowed under the scope of a A-abstraction) call-by- value 
reduction can simulate each other with a linear overhead. In particular, weak call-by- 
value beta-reduction can be simulated by an orthogonal constructor term rewrite system 
in the same number of reduction steps. Conversely, each reduction in a term rewrite 
system can be simulated by a constant number of beta-reduction steps. This is relevant 
to implicit computational complexity, because the number of beta steps to normal form 
is polynomially related to the actual cost (that is, as performed on a Turing machine) of 
normalization, under weak call-by-value reduction. Orthogonal constructor term rewrite 
systems and A-calculus are thus both polynomially related to Turing machines, taking as 
notion of cost their natural parameters. 



Introduction 

Implicit computational complexity is a young research area, whose main aim is the descrip- 
tion of complexity phenomena based on language restrictions, and not on external measure 
conditions or on explicit machine models. It borrows techniques and results from math- 
ematical logic (model theory, recursion theory, and proof theory) and in doing so it has 
allowed the incorporation of aspects of computational complexity into areas such as formal 
methods in software development and programming language design. The most developed 
area of implicit computational complexity is probably the model theoretic one — finite 
model theory being a very successful way to describe complexity classes. In the design 
of programming language tools (e.g., type systems), however, syntactical techniques prove 
more useful. In the last years we have seen much work restricting recursive schemata and 
developing general proof theoretical techniques to enforce resource bounds on programs. 
Important achievements have been the characterizations of several complexity classes by 
means of limitations of recursive definitions (e.g., [BC92, Lei95 ) and, more recently, by 
using the "light" fragments of linear logic |Gir98] . Moreover, rewriting techniques such 
as recursive path orderings and the interpretation method have been proved useful in the 
field [MMOOJ. By borrowing the terminology from software design technology, we may dub 
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this area as implicit computational complexity in the large, aiming at a broad, global view 
on complexity classes. We may have also an implicit computational complexity in the small 
— using logic to study single machine-free models of computation. Indeed, many models of 
computations do not come with a natural cost model — a definition of cost which is both 
intrinsically rooted in the model of computation, and, at the same time, it is polynomi- 
ally related to the cost of implementing that model of computation on a standard Turing 
machine. The main example is the A-calculus: the most natural intrinsic parameter of a 
computation is its number of beta-reductions, but this very parameter bears no relation, in 
general, with the actual cost of performing that computation, since a beta-reduction may 
involve the duplication of arbitrarily big subterms^. What we call implicit computational 
complexity in the small, therefore, gives complexity significance to notions and results for 
computation models where such natural cost measures do not exist, or are not obvious. In 
particular, it looks for cost-explicit simulations between such computational models. 

The present paper applies this viewpoint to the relation between A-calculus and orthog- 
onal constructor term rewrite systems (OCRSs in the following). We will prove that these 
two computational models simulate each other with a linear overhead. That each OCRS 
could be simulated by A-terms and beta-reduction is well known, in view of the availability, 
in A-calculus, of fixed-point operators, which may be used to solve the mutual recursion 
expressed by first-order rewrite rules. Here (Section [3]) we make explicit the complexity 
content of this simulation, by showing that any first-order rewriting of n steps can be sim- 
ulated by kn beta steps, where k depends on the specific rewrite system but not on the 
size of the involved terms. Crucial to this result is the encoding of constructor terms using 
Scott's schema for numerals Wad80]. Indeed, Parigot [Par90] (see also [PR93J) shows that 
in the pure A-calculus Church numerals do not admit a predecessor working in a constant 
number of beta steps. Moreover, Splawski and Urzyczyn [SU99] show that it is unlikely 
that our encoding could work in the typed context of System F. 

Section [2] studies the converse - the simulation of (weak) A-calculus reduction by means 
of OCRSs. We give an encoding of A-terms into a (first-order) constructor term rewrite 
system. We write [•]$ for the map returning a first-order term, given a A-term; [M]$ is, 
in a sense, a complete defunctionalization of the A-term M, where any A-abstraction is 
represented by an atomic constructor. This is similar, although not technically the same, 
to the use of supercombinators (e.g., [Jon87| ) . We show that A-reduction is simulated step 
by step by first-order rewriting (Theorem 12. 9p . 

As a consequence, taking the number of beta steps as a cost model for weak A-calculus 
is equivalent (up to a linear function) to taking the number of rewritings in OCRSs systems. 
This is relevant to implicit computational complexity "in the small" , because the number of 
beta steps to normal form is polynomiaily related to the actual cost (that is, as performed 
on a Turing machine) of normalization, under weak call-by-value reduction. This has been 
established by Sands, Gustavsson, and Moran [SGM02], by a fine analysis of a A-calculus 
implementation based on a stack machine. OCRSs and A-calculus are thus both reasonable 
machines (see the "invariance thesis" in [vEB90]), taking as notion of cost their natural, 
intrinsic parameters. 

1 In full beta-reduction, the size of the duplicated term is indeed arbitrary and does not depend on the 
size of the original term the reduction started from. The situation is much different with weak reduction, as 
we will see. 
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As a byproduct, in Section |4] we sketch a different proof of the cited result in [SG M02] , 
Instead of using a stack machine, we show how we could implement constructor term rewrit- 
ing via term graph rewriting. In term graph rewriting we avoid the explicit duplication and 
substitution inherent to rewriting (and thus also to beta-reduction) and, moreover, we 
exploit the possible sharing of subterms. A more in-depth study of the complexity of (con- 
structor) graph rewriting and its relations with (constructor) term rewriting can be found 
in another paper by the authors [DLM10J. 

In Section [5j we show how to obtain the same results of the previous sections when 
call-by-name replaces call-by- value as the underlying strategy in the A-calculus. 

1. Preliminaries 

The language we study is the pure untyped A-calculus endowed with weak (that is, we never 
reduce under an abstraction) call-by-value reduction. 

Definition 1.1. The following definitions are standard: 

• Terms are defined as follows: 

M ::= x | Xx.M \ MM, 

where x ranges a denumerable set T. A denotes the set of all A-terms. We assume 
the existence of a fixed, total, order on T; this way FV(M) will be a sequence (without 
repetitions) of variables, not a set. A term M is said to be closed if FV(M) = e, where 
e is the empty sequence. 

• Values are defined as follows: 

V ::= x | Xx.M. 

• Weak call-by-value reduction is denoted by — > v and is obtained by closing call-by-value 
reduction under any applicative context: 

M -ty N M — »u N 

(Xx.M)V -> v M{V/x} ML -> v NL LM -> v LN 

Here M, N, L range over terms, while V ranges over values. 

• The length \M\ of M is defined as follows, by induction on M: \x\ = 1, |Ax.M| = \M\ + 1 
and |MJV| = \M\ + \N\ + 1. 

Weak call-by-value reduction enjoys many nice properties. In particular, the one-step dia- 
mond property holds and, as a consequence, the number of beta steps to normal form (if 
any) is invariant on the reduction order [DLM08] (this justifies the way we defined reduction, 
which is slightly more general than Plotkin's one |Plo75| ). It is then meaningful to define 
Time v (M) as the number of beta steps to normal form (or uj if such a normal form does not 
exist). This cost model will be referred to as the unitary cost model, since each beta (weak 
call-by-value) reduction step counts for 1 in the global cost of normalization. Moreover, 
notice that a-conversion is not needed during reduction of closed terms: if M — > v N and 
M is closed, then the reduced redex will be in the form (Xx.L)V, where V is a closed value. 
As a consequence, arguments are always closed and open variables cannot be captured. 
Suppose M has n free variables x\ < . . . < x n , and that Aq, . . . , N n are lambda-terms. 
The term M{Ni/x±, . . . , N n /x n } is sometimes denoted simply with M(Aq, . . . , N n ), taking 
advantage of the implicit order between the variables. 
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The following lemma gives us a generalization of the fixed-point (call-by-value) com- 
binator (but observe the explicit limit k on the reduction length, in the spirit of implicit 
computational complexity in the small): 

Lemma 1.2 (call-by-value fixpoint combinator). For every natural number n, there are 
terms H±, . . . , H n and a natural number m such that for any sequence of values V±, . . . ,V n 
and for any 1 < i < n: 

HiVy ...V n ^ k v Vi{\x.H x Vi ...V n x)... (Xx.HnVy . . . V n x), 

where k < m. 

Proof. The terms we are looking for are simply the following: 

H l = M i M 1 ...M n 

where, for every 1 < j < n, 

Mj = Axi Ax n .Ayi y n .yj{\z.xixi . . . x n y x . . . y n z) . . . (\z.x n x\ . . . x n yi . . . y n z). 

The natural number m is simply 2n. □ 

We only consider orthogonal and constructor rewriting in this paper. A constructor 
term rewrite system is a pair S = (S=,7^s) where: 

• Symbols in the signature £~ can be either constructors or function symbols, each with 
its arity. 

• Terms in C(H) are those built from constructors and are called constructor terms. 

■ Terms in V(S, T) are those built from constructors and variables and are called 
patterns. 

■ Terms in T(H) are those built from constructor and function symbols and are called 
closed terms. 

■ Terms in V(H, T) are those built from constructors, functions symbols and variables 
in T and are dubbed terms. 

• Rules in 7Zs are in the form f(si, . . . , s n ) — >z t where f is a function symbol, s\, . . . ,s n € 
■p(H, T) and t £ V(H,T). We here consider orthogonal rewrite systems only, i.e. we 
assume that no distinct two rules in are overlapping and that every variable appears 
at most once in the lhs of any rule in IZ5. Moreover, we assume that reduction is call- 
by-value, i.e. the substitution triggering any reduction must assign constructor terms to 
variables. This restriction is anyway natural in constructor rewriting. 

For any term t in a OCRS, \t\ denotes the number of symbol occurrences, while |i|f denotes 
the number of occurrences of the symbol f in t. Similarly to A-terms, if t contains instances 
of n variables x\ < ... < x n , the term t{u±/xi, . . . ,u n /x n } is sometimes denoted simply 
with t(ui, . . . , u n ). 

2. From A-Calculus to Constructor Term Rewriting 

In this section, we will prove that the A-calculus, in the form introduced in Section [U can 
be seen as a OCRS. This result will be spelled out as follows. 

• An OCRS $ on a signature will be defined, together with two maps [•]$ : A — > 
V(E$,T) and (-)a : V(S$,T) — > A. These two maps are not bijections. However, [•]$ is 
injective, and (-)a o [■]$ is the identity. 

• The concept of canonicity for terms in V(<3?,T) will be defined. Moreover, the set of 
canonical terms will be shown to include [A]$ and to be closed by reduction. 
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• Reduction of canonical terms will be shown to simulate weak call-by-value reduction on 
A-terms, via (-)a- Conversely, the dynamics of A-terms is proved to simulate rewriting 
of constructor terms again through (-)a- 

Altogether, the three ingredients above implies that $ is a sound and complete way of 
implementing call-by-value /3-reduction. 

Let us start by defining $ and the two functions allowing to translate terms <I> into 
A-terms and, conversely, A-terms back into terms of <3?. Canonicity can already be defined. 

Definition 2.1 (The OCRS <I>, Canonicity). The OCRS $ is defined as a set of rules H<$> 
over an infinite signature S$. In particular: 

• The signature includes the binary function symbol app and constructor symbols 
c x,M for every M G A and every x G T. The arity of c X: m is the length of FV(Ax.M). 
To every term M G A we can associate a term [M]$ G V(<&, T) as follows: 

[x]$ = x; 
[Xx.M] 

<& — c x,m(xi, . . . , x n ), where FV(Ax.M) — x\, x n ; 

[MN}$ = app([M]*,[JV]*). 

Observe that if M is closed, then [M]$ G T($). 

• The rewrite rules in TZq, are all the rules in the following form: 

app(c Xi M(xi, . . . ,x n ),x) [M]$, 

where FV(Ax.M) = x\, . . . , x n . 

• To every term t G V(<I>, T) we can associate a term (t) A G A as follows: 

(x) A = x 
{app(u,v)) A = {u)a{v) a 
{c x ,M(h, ■ ■ -t n )) A = (Xx.M){(t 1 ) A /x 1 ,. . . ,(t n ) A /x n } 

where FV(Ax.M) = x±, . . . , x n . 

• A term t G T(3>) is canonical if either t G C($) or i = app(u, u) where u and w are 
themselves canonical. 

Notice that the signature contains an infinite number of constructors. 

Example 2.2. Consider the A-term M = (Xx.xx)(Xy.yy). [M]$ is t = &pp(c x xx , 
Moreover, t — > app(c ?/iOT , c ytyy ) = u, as expected. We have u — > u. Both t and u are 
canonical. Finally, (u) A = (Xy.yy)(Xy.yy). 

The map [•]$ is injective, but not surjective. However: 

Lemma 2.3. For every X-term M G A, ([M]$)a = M . 

Proof. By induction on M: 

• If M = x, then 

([M]*)a = (N*)a = (sc)a = a;- 

• If M = NL, then 

([M] $ ) A = (a PP ([A] $ , [L]*)) A = <[JV]*)a([L]*)a = JVL. 

• If M = Ay. A, then 

([M]$) A = (cy^ix!, x n )) A = (Xy.N){x 1 /x 1 , x n /x n } = Xy.N = M. 
This concludes the proof. □ 
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Canonicity holds for terms in $ obtained as images of (closed) A-terms via [•]$. More- 
over, canonicity is preserved by reduction in 

Lemma 2.4. For every closed Me A, [M]$ is canonical. Moreover, if t is canonical and 
t — > u, then u is canonical. 

Proof. [M]$ is canonical for any M € A by induction on the structure of M (which, by 
hypothesis, is either an abstraction or an application NL where both N and L are closed). 
We can further prove that v = [M\${t\/xi, . . . t n /x n } is canonical whenever t\, . . . ,t n £ 
£(<£) and xi,...,x n includes all the variables in FV(M): 

• If M = Xi, then v = U, which is clearly canonical. 

• If M = NL, then 

v = [NL]< s ,{t 1 /x 1 ,...t n /x n } 

= app ([N}${t 1 /x 1 , . . . t n /x n }, [L]<s>{ti/x u . . . t n /x n }) 

which is canonical, by IH. 

• If M = Xy.N, then 

v = [\y.N]${t 1 /x 1 ,...t n /x n } 

= £y,N(.%i\ -i ■ ■ ■ i ■£i m ){^l/'£l' • • • ^n/ %n} 
= c y,N (th :■■■■> U m ) 

which is canonical, because each t{ (and hence also v) is in C(&). 
This implies the rhs of any instance of a rule in 1Z§ is canonical. As a consequence, u is 
canonical whenever t — > u and t is canonical. This concludes the proof. □ 

For canonical terms, being a normal form is equivalent to being mapped to a normal 
form via (-)a- This is not true, in general: take as a counterexample c Xjy (app(c Zj 2, c ZfZ )), 
which corresponds to Xx.(Xz.z)(Xz.z) via (-)a- 

Lemma 2.5. A canonical term t is a normal form iff (t)\ is a normal form. 

Proof. If a canonical t is a normal form, then t does not contain the function symbol app 
and, as a consequence, (t)\ is an abstraction, which is always a normal form. Conversely, 
if (<)a is a normal form, then t is not in the form app(n, v), because otherwise (£)a will be 
a (closed) application, which cannot be a normal form. But since t is canonical, t € C($), 
which only contains terms in normal form. □ 

The following substitution lemma will be useful later. 

Lemma 2.6 (Substitution). For every term t € V(3>, T) and every t±, . . . , t n G C($>) ; 

(t{ti/xi, . . . , t n /x n }) A = {t)A{{ti}\/xi, {t n ) k /x n } 

whenever x±, . . . , x n includes all the variables in t. 

Proof. By induction on t: 

• If t = Xi, then 

(tih/xi, . . . ,t n /x n }}A = (xi{ti/x 1 ,...,t n /x n }) A 

= {U)a 

= x i {(t 1 ) A /x 1 ,...,{t n ) A /x n } 

= t{(tl)\/xi, . . . , {t n ) A /x n }- 
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• If t = app(ii, v), then 

{tih/xi, . . . ,t n /x n }) A = (app(u,v){t 1 /xi, . . . ,t n /x n })\ 

= (app(ii{ti/zi, . . . ,t n /x n },v{h/xi, . . . ,t n /x n }))\ 

= {u{t 1 /x 1 , . . . ,t n /x n })A{v{h/xi, . . . ,t n /x n }} A 

= («)a{(*i)a/jC1, ■ ■ ■ , (tn)\/x n }(v)\{(ti)\/xi, . . . , {t n )A/x n } 
= {u)A{v) A {{tl)A/xi, {t n )A/x n } 

= (app(u, v)) A {{h)A/xi, ... , (t n )A/x n } 
= {t}A{(h)\/xi,. . . ,{t n ) A /x n }. 

• If t = Cy tN (m, . . . ,u m ), then 

(t{h/xi, . . . ,t n /x n })A = {Cy,N(ui,...,U m ){ti/xx,...,t n /x n })A 

= (c y ,N(ui{ti/xi, ■ ■ ■ ,t n /x n }, . . . ,u m {ti/xi, . . . ,t n /x n }))A 
= (Xy.N){(ui{ti/xi, . . . , tn/Xn^A/x^ 

? * * * 5 

{u m {h/xi, . . . ,t n /x n })A/x im } 
= (\y.N){(ui) A {{ti)A/xi, ■■■ , {tn)h/x n }/x il 

J • • • ) 

{Um)A{(h)\/xi, . . . , {t n )A/x n }/x im } 

= ((Xy.N){(ui)A/xi, . . . ,u m /xi 1 }){(t 1 } A /x 1 , (t n } A /x n } 
= (Cy t N(lii, . . . ,U m ))A{{ti)A/xi, ... , {t n )A/x n } 
= (*}a{(*i)a/»1 i • • • j (tn)A/x n }. 

This concludes the proof. □ 

Two of the previous lemmas imply that if M G A, t±,. . . ,t n S C($) and x\, . . . ,x n 
includes all the variables in FV(M), then: 

([M] 9 {ti/ Xl , t n /x n })A = M{(ti) A /zi, • • • , {t n )A/x n }. (2.1) 

Reduction in $ can be simulated by reduction in the A-calculus, provided the starting term 
is canonical. 

Lemma 2.7. If t is canonical and t —¥ u, then if) a (u)a- 
Proof. Consider the (instance of the) rewrite rule which turns t into u. Let it be 
app(c 2/iM (ii, • • • ,t n ),v) -> [M]${ti/xi, . . . ,t n /x n ,v/y}. 

Clearly, 

{app(cy tM (ti, ■ ■ -,t n ),v))A = ((Xy.M){(t 1 )A/xi, {t n )A/x n }){v) A 
while, by ([2TTD : 

([M]${ti/xi, . . . ,t n /x n ,v/y})A = M{{tx)\/x\, {t n )A/x n , {v) A /y} 
which implies the thesis. □ 
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Conversely, call-by-value reduction in the A-calculus can be simulated in <J>: 

Lemma 2.8. If M —¥ v N , t is canonical and (£)a = M, then t — >■ u, where (u)a = N. 

Proof. Let (Xx.L)V be the redex fired in M when rewriting it to N. There must be a 
corresponding subterm v of t such that {v)\ = (Xx.L)V. Then 

v = app(c Xi p(ti, . . .,t n ),w), 

where (c Xi p(ii, . . . , t n )}\ = Xx.L. and (w)a = V. Observe that, by definition, 

(c x ,p(h, ■ ■ ■ ,t n ))A = (Xx.P){(t 1 ) A /x 1 , . . . , {t n )x/x n } 

where FV(P) = x\, . . . ,x n . Since t is canonical, t\, . . . ,t n G C($). Moreover, since V is a 
value, w itself is in C($). This implies 

appfc-^pOi, . . .,t n ),w) -4 [P]${ti/xi, . . . ,t n /x n ,w/x}. 

By ([23]): 

([P]${ti/a? 1 ,...,t fl /a; fl ,io/a:}}A = P{(£i)a/zi, • • • , {t n )k/x n , (w)\/x} 

= (P{(h)\/xi, . . . , it n ) K /x n }){{w)x/x} 
= {Xx.L){V/x}. 

This concludes the proof. □ 

The previous lemmas together imply the following theorem, by which A-calculus nor- 
malization can be mimicked (step-by-step) by reduction in <£: 

Theorem 2.9 (Term Reducibility) . Let M G A be a closed term. The following two 

conditions are equivalent: 

1. M — >™ N where N is in normal form; 

1. [M]<j> — > n t where (t)\ = N and t is in normal form. 

Proof. Suppose M — N, where N is in normal form. Then, by applying Lemma 12.81 we 
obtain a term t such that [M]$ — )- n £ and (t)\ = N. By Lemma |2.4| t is canonical and, 
by Lemma 12.51 it is in normal form. Now, suppose [M]$ — > n t where (t}\ = N and t is 
in normal form. By applying n times Lemma 12.71 we obtain ([M]$)a — (t)\ = N. But 
([M]$)a = M by Lemma 12.31 and N is a normal form by Lemma 12.51 since [M]$ and £ are 
canonical by Lemma 12.41 □ 

There is another nice property of that will be crucial in proving the main result of 
this paper: 

Proposition 1 (Subterm Property). For every M G A, for every t with [M]$ — y* t and for 
every occurrence of a constructor c X) at in t, N is a subterm of M. 

Proof. Assume — > n t and proceed by induction on n. □ 
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Example 2.10. Let us consider the A-term M = (Xx.(Xy.x)x)(Xz.z). Notice that 

M —> v (Xy.(Xz.z))(Xz.z) — >■„ Xz.z. 

Clearly [M]$ = app( 

c x,(\y.x)xi c z,z)- Moreover: 

app(cx,(\y.x)x, c z,z) -> app(cy }X (c z>z ),c z>z ) -» C ZjZ . 

For every constructor c W) jv occurring in any term in the previous reduction sequence, N is 
a subterm of M . 

A remark on $ is now in order. $ is an infinite OCRS, since E<j> contains an infinite 
amount of constructor symbols and, moreover, there are infinitely many rules in 7Z$. As 
a consequence, what we have presented here is an embedding of the (weak, call-by-value) 
A-calculus into an infinite OCRS. Consider, now, the following scenario: suppose the A- 
calculus is used to write a program M, and suppose that inputs to M form an infinite set 
of A-terms O which can anyway be represented by a finite set of constructors in <5. In this 
scenario, Proposition Q] allows to conclude the existence of finite subsets of E$ and TZ$ such 
that every MN (where N £ Q) can be reduced via by using only constructors and rules 
in those finite subsets. As a consequence, we can see the above schema as one that puts any 
program M in correspondence to a finite OCRS. Finally, observe that assuming data to be 
representable by a finite number of constructors in <3? is reasonable. Scott's scheme [Wad80j, 
for example, allows to represent any term in a given free algebra in a finitary way, e.g. the 
natural number becomes [0] = c y \z.z while n + 1 becomes \n + 1] = c y \ zyx (\n\). 
Church's scheme, on the other hand, does not have this property. 

2.1. An Example. Consider the lambda terms M = Xx.Xy.xyx and N = Xx.Xy.yxy. It is 
easy to verify that: 

L = (MN)M -> 2 V (NM)N, 
P = (NM)N -> 2 V (MN)M. 
Therefore, both L and P diverge. Now: 

[L]$ = app(app(c rriA?/ .xyx-i Cx,\y.yxy)i ^x,\y.xyx) 
y t = &Pp(Cy t xyx(.Cx,\y .yxy)i ^x ,Xy .xyx) 
^ &PP(<*PP{Cx, Xy.yxy j ^x ,Xy .xyx)i ^x,\y.yxy) 
EE 

Similarly, [P]$ — > 2 Observe that along the computation we reach the term t, which is 

not the image of any A-term. However, all constructor terms in the reduction are canonical 
and, moreover, (t)\ is (Xy.NyN)M, the lambda term found along the reduction from L to 
P. 



3. From Constructor Term Rewriting to the A-Calculus 

In this section, we will show that one rewriting step of any constructor rewrite system can be 
simulated by a fixed number of weak call-by-value beta-reductions. As an easy consequence, 
A-calculus will be shown to efficiently simulate any OCRS. During this section we will assume 
fixed an OCRS S over a finite signature Eg. Let ci, . . . ,c g be the constructors of S and let 
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fi, . . . , ih be the function symbols of E. We will describe several constructions, which work 
independently of E (they only depends on the arity of the symbols). 

• A map ((-))a : — > A can be defined by recursion on the structure of the input. The 
map can be extended to constructors of S (which are not terms by themselves), in such 
a way that for every Cj, the lambda term ((ci))a "computes" ((cj(ti . . .i a r(ci))))A when 
fed with ((h)) a . . . ((i or (ci)))A, for any t x , . . . ,t ar ( Ci ) G C(S). (See Definition |3]TJ) 

• Defining a map analogous to ((-))a, but acting on closed terms (and not only on con- 
structor terms) is more delicate. Indeed, a term fi(ti . . . t ar r Ci \) does not necessarily 
rewrite to a constructor term, even if it does not diverge — the rewrite rules of E are 
not necessarily exhaustive and a deadlock can be reached. To handle this case we define 
a lambda term _L € A, which will represent any deadlocked term. 

• Now a map [-]a : 7"(E) — > A can be defined, in such a way that [£]a reduces to ((it)) a 
(where u € C(S)) if t has normal form u, but [£]a reduces to _L if t rewrites to a deadlock. 
The map [-]a is defined compositionally, that is to say: 

[c(ti, . . . ,t ar ( Ci ))]A = [Ci]a[*i]a • • • [tar( Ci )]A; 

[fj(tl, . . . ,t ar ( fi ))]A = [fi]A[*l]A • • • [*ar(fj)]A- 

In other words, [-]a is completely specified by its behavior on constructors and function 
symbols. 

• While defining [c]a is relatively easy (Definition 13.21 and Lemma [3. 3p . [f]A requires a form 
of pattern matching to be implemented in the A-calculus (Lemma 13 . 41 and Definition [33]). 

• The complete simulation is stated in Theorem 13.61 The example in 13.11 may be used 
along the section to clarify the definitions. 

We will first concentrate on constructor terms, encoding them as A-terms using Scott's 
schema [Wad80| . 

Definition 3.1. • Constructor terms can be easily put in correspondence with A-terms 
by way of a map ((-))a defined by induction as follows: 

((cj(ii . . . ,t„))) A = Axi Xx g .Xy.Xi((ti)) A . . . ((* r ))a- 

• The function ((-))a can be extended to a map on constructors: 

{{ci))\ = Xxi Ax ar(Ci) .Ayi Xy g .Xz.yiXi . . . x ar{c% y 

Trivially, if ti,...,t n are in C($), ((cj)) A ((ti))A • • • ((*h))a rewrites to ((cj(ti . . . t n ))) A in 
ar(cj) steps. 

• To represent an error value, we use the A-term _L = Aa^i Xx g .Xy.y. A A-term which 

is either _L or in the form ((t))\ is denoted with metavariables like X or Y . 

The map ((-))a defines encodings of constructor terms. For function symbols our goal is 
defining another map [-]a returning a A-term given any term t in T(E), in such a way that 
t — u and u € C(S) implies \t] A ((it)) A- Moreover, [t]\ should rewrite to _L whenever 
the rewriting of t causes an error (i.e. t has a normal form containing a function symbol). 
First of all, we define the A-term [cj]a corresponding to a constructor Cj. 

Definition 3.2. • For every 1 < i < g, for every < m < ar(cj), and for every sequence 
of variables x\, ■ ■ ■ ,x m , define the A-term CON l xl ^ Xm by induction on ar(cj) — m: 

CON Xx> ^ Xar(ci) = Xyi Xy g .z.y iXl . . . x ar{Ci y, 

Vm:0<m<ar( Cl ) COX^ ,,. = Xy.yNft . . . N™L?- 



ON CONSTRUCTOR REWRITE SYSTEMS AND THE LAMBDA-CALCULUS 



11 



where: 

N Ti = ^ Xz ar(Cj) .(Xx m+1 .CONi u , M )CON{ u ...^ 
L™ = \z m+ 2 \z ar ( c .y±. 

• For every 1 < i < g, the A-term [ci] A is CON' l £ . 

We need to prove that [cj]a does what it is supposed to do. We show something slightly 
stronger: 

Lemma 3.3. There is a constant n G N such that for any i, for any m, and for any 
«*i»A, ■ ■ ■ , ((*ar(ci)»A inC(E): 

CON Xu ^ Xm {((t 1 )) A /x 1 , ((t m )) A /x m }((t m+1 )) A . . . ((t ar{Ci) )) A ^ k ((Ciih . . . t ar[Ci) ))) A 

where k < n, and 

CON i Xl ^ Xm {{{t 1 )) A /x 1 , ((t m )) A /x m }X m+1 . . . X ar(ci) -V ± 

where I < n, whenever Xj is either ((tj)) A or _L but at least one among X m+ i . . . X ar r c .\ is 
1_. 

Proof. We proceed by induction on ar(cj) — m: 

• If m = ar(cj), then 

c ' 0iV l ! i ) ...,* < „.( C . ) {«*i»A/a:i, • • • , ((iar(c,)))A/a: ar ( Ci )} 

= (Aj/l \VgViX\ . . . Xor( Ci )){((*l))A/a;i, • • • , ((*ar(ci)))A/z or ( Ci )} 

= Xyi Xy g .yi{{ti)) A . . . ((i ar ( Ci )))A 

= ((Cj(ti, . . . ,t ar ( c .)))) A . 

• If m < ar(cj), we use the following abbreviations: 

P% = N™ i {((h))A/x 1 ,...,((t m )) A /x m }; 
Qf = Lf{((t 1 )) A /x 1 ,...,((t m ))A/x m }. 

Let's distinguish two cases: 
• If X m+ \ = _L, then: 

CON Xi _ Xm {((t 1 )) A /x l , {{t m )) A /x m }X m+1 . . . X ar(Ci) 

->•„ (J-P{^ . . . P™Q™)^m+2 • • • X ar( Ci ) 

— s>* Q^X m+ 2 . . . X ar ( c ^ 
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• Let X rn+ i be ((t m+ i)} A , where t m +\ = Cj(m, . . . ,u OT .( Cj .)). Then: 
CONi u ^{{{t^A/x!, {{t m )) A /x m }X m+1 . . . X ar(c%) 

~>v ({{ c j( u li • • • ' u ar{ Cj ))))KP™i ■ ■ ■ P™iQT) X m+2 ■ ■ ■ X ar(a) 
~>v P j!A u l))^ ■ ■ ■ (( u ar( Cj )}}A X m+2 ■ ■ ■ X ar(a) 

->% (Ax ro +i.C0iVi lv .. )a . tn+1 {((ti»A/xi, • • • , ({t m )) A /x m }) 

(CON J Zl ,..., Zar(Cj) {{{ui))A/yi, • • • , ((t ar{C]) )) A /y ar{C]) })X m+2 . . . X ar(Ci) 
->* (Ax m +i.COiVi 1) ... ia . tn+1 {((ti»A/ari, • • • , {{t m ))A/x m }) 

(((Cj(«i, . . . , U ar ( Cj )))) A )^m+2 • ■ ■-X'or(ci) 

CON l xu ... )Xm+1 {((tl)) A /zi, . . . , ({t m+ i)}A/x m+ i}X m+2 . . .X or ( c .) 

and, by the inductive hypothesis, the last term in the reduction sequence reduces to 
the correct normal form. The existence of a natural number n with the prescribed 
properties is clear by observing that none of the reductions above have a length which 
depends on the parameters ((*i}}a, ■ ■ ■ , ((t m ))A and X m+l . . . X ar ( c .y 
This concludes the proof. □ 

Interpreting function symbols is more difficult, since we have to "embed" the reduction 
rules into the A-term interpreting the function symbol. To do that, we need a preliminary re- 
sult to encode pattern matching. More specifically, suppose a±, . . . ,a n are non-overlapping 
sequences of patterns of the same length m, i.e. that for every sequence of constructor 
terms ti, . . . ,t m there is at most one i with 1 < i < m such that ti,...,t m unifies with 
the patterns in ol{. Then, we need to build a A-term PAT 7 £ 1 Qn which, when fed with m 
(encodings of) constructor terms and n values, perform pattern matching and select the 
"right" value, or returns _L if none of a±, . . . , a n unifies with the constructor terms in input. 

Lemma 3.4 (Pattern matching). Let «i, . . . ,a n be non- overlapping sequences of patterns 
of the same length m. Then there are a term PAT^ an and an integer I such that for 
every sequence of values V\, . . . , V n , if ai = si, . . . , s m then 

PATZ^aJs^tl, . . . ,^)» A . . . « Sm (4, . . . ,t*T)»AVi ...V n 

^ k v wi>>A...«^>>A...toA...«t5r>>A > 

where k < I, whenever the t\ are constructor terms. Moreover, 

PAT™ 1 ___ cen X 1 , . . . ,X m Vi ...V n -^y±, 

where k < I, whenever X±, . . . , X m do not unify with any of the sequences cx\,...,a n or any 
of the Xi,..., X m is itself J- . 

Proof. We go by induction on a = Y17=i ll a «ll> wnere ll a «ll is the number of constructors 
occurrences in patterns inside af. 

• If a = and n = 0, then we should always return _L: 

PAT™ = Axi Ax m ._L. 

• If a = and n > 0, then n = 1 and cti is simply a sequence of variables x\, . . . ,x m , 
because the cnj are assumed to be non-overlapping. Then PAT^ Xm is a term defined 
by induction on m which returns _L only if one of its first m arguments is _L and otherwise 
returns its m + 1-th argument applied to its first m arguments. 
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• If a > 1, then there must be integers i and j with 1 < i < m and 1 < j < n such that 
cxj = s\, . . . , Si— i, C£.(ri, . . . , r ar ( Cfe )), Sj+i, . . . , s m 

for a constructor and for some patterns s p and some r q . Now, for every 1 < p < n and 
for every 1 < j < g we define sequences of patterns (3p and values Wp as follows: 
• If 

Op = S\, . . . , , Cj (Qi, . . . , Qar(cj) \ Sj+1 ■ ■ ■ &r. 



then /3p is defined to be the sequence 



Si, . . . , Si-i, qi, ... , q a r(cj)i ■ ■ ■ > s 



Moreover, W p is simply the indentity Xx.x. 

• If 

q p = si, . . . , Sj_i, c s (qi, . . . , g , ar ( Cs )), Sj+i . . . s 

where s ^ j then (3p and Wp are both undefined. 

• Finally, if 

CXp — S\ , . . . , 5j— 1, X, Sj+l . . . S m 

then /3p is defined to be the sequence 



in 



s li ■ ■ ■ : s i— 1) %li ■ ■ ■ ) 2-ar(cj); ^i+1) ■ ■ ■ j 



Sr. 



and Wp is the following A-term 

Xx.Xy 1 \yt-xi Ax ar ( Cfe) .Azi \z u .xyi . . . y t ({{c j )) A x 1 . . . x ar ^)z 1 . . . z u 

where t is the number of variables in si, . . . , Sj_i and u is the number of variables in 

Sj+l j • • • j s m- 

As a consequence, for every 1 < j < g, we can find a natural number tj and a sequence 
of pairwise distinct natural numbers ii,...,«t 3 - such that /3f ,...,/3f are exactly the 

sequences which can be defined by the above construction. We are now able to formally 
define PAT™ „ ; it is the term 

Aa;i \x m .\y\ Xy n .((xiZi . . . Z g Z ± )xi . . . Xi-ix i+i . . . x m )y\ ...y n 

where 

Vl<j<g-Zj = Xz 1 \z ar ( c .y\xi Axj-i.Axj+i Ax m .Ayi \y n . 

p AT m-\+ar{ Cj ) ^ Xi _ lZl . . . z ar(C]) X i+1 . . . x m {W^y h ) . . . {W ] it y k ) 

Z± = Xxi \xi_i.\x i+ i \x m .\yi Ay n ._L 

Notice that, for every j, a > Y^v=i WPv I I- Moreover, for every j any (3'i has the same 
length m — 1 + ar(cj). This justifies the application of the induction hypothesis above. 
Informally, PAT m lt an first do some case analysis based on the shape of its i-th argu- 
ment. Based on the topmost constructor in it, one between Zi,..., Zh, Z± is selected 

'YYi, ~\_ j (XT' f C " ) 

which itself do the rest of the pattern matching by way of PAT • . 
This concludes the proof. □ 
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Once a general form of pattern matching is available in the A calculus, we may define 
the A-term [fj]A interpreting the function symbol fj. 

Definition 3.5. For every function symbol fj, let 

be the rules for f«. Moreover, suppose that the variables appearing in the patterns in a\ are 

zf' 1 , . . . , zf™ 11 " 3 . Observe that the sequences a],..., a™* all have the same length m. Recall 
that we have a signature with function symbols fi, . . . ,fh- For any 1 < i < h the A-term 
[fj]A interpreting fj is defined to be: 

H l V l ...V h 

where 

Vi = Xx! Xx h .\ yi Ay ar(fi) .PA7^i ) ___ )Q „yi...y ar(fi) Wi 1 ...Wi ni ; 

wi = x Zl Xz mi ..i\t\U; 

whenever 1 < i < h and 1 < j < rii and \ ■ \\ is defined by induction as follows: 

\x\ik = x; 

(|Ci(tl, • • • ,t ar (ci))\)A = Na^iDa • • • d*ar( Ci )l)A; 
^i(h, ■ ■ ■ ,t a r(fi))H = Xi(\ti\) A . . . dt ar ( fi )DA- 

We have now implicitly defined how the map [-]a behaves on any term in V(H, T): 

[x]\ = x; 

[c(ti, . . . ,t or ( Cj ))]A = [Ci]a[*i]a • • • [tar(ci)W, 
[fj(tl, . . . ,t ar ( f .))]A = [fj]A[il]A • • • [t a r(ti)]A- 

Theorem 3.6. There is a natural number k such that for every function symbol f and 
for every t\, . . . , t ar m £ C(H), the following three implications hold, where u stands for 
f(*i, • • ■ ,*or(f)) and M stands for [f]A((ii))A • • • ((* a r(f)»A- 

• If u rewrites to v € C(H) in n steps, then M rewrites to {{v}}^ in at most kn steps. 

• If u rewrites to a normal form v ^ C(S), then M rewrites to _L. 

• If u diverges, then M diverges. 

Proof. By an easy combinatorial argument following from the definition of [-]a- Actually, a 
slightly stronger statement should be proved to make the proof formal: there is a natural 
number k such that for every u G V(S, T), for every ti,...,t m G C(H) (where m is the 
number of distinct variables in u), the following three implications hold, where M stands 
for [u]a- 

• If u(ti, . . . ,t m ) rewrites to v £ in n steps, then M(((ti))\, . . . , ({t m ))\) rewrites to 
{{v))a in at most kn\u\ steps. 

• If u(ii, . . . ,t m ) rewrites to a normal form v C(E), then M({(ti))\, . . . , ((t m ))A) rewrites 
to _L. 

• If u(ti, ...,t m ) diverges, then M(((ti)) A , . . . , {{t m )) A ) diverges. 

The first statement can be proved by induction on n. The second and third one are quite 
easy. □ 
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Clearly, the constant k in Theorem 13.61 depends on 3, but is independent on the par- 
ticular term u. 

3.1. An Example. In this section, we will describe the encoding of a concrete OCRS called 
AT>T> as a set of A-terms. The signature S^dd contains two constructor symbols and s, 
with arity and 1 (respectively), and a single function symbol add of arity 2. The only 
two rules in TZaw are the following: 

add(0, x) — > x; 

add(s(x),y) — > s(add(x), y). 

Let us construct first some A-terms in the image of ((}}a : 

!_ = Xx.Xy.Xz.z; 

((0))a = Xx.Xy.Xz.x; 

«s(0)»a = Xx.Xy.Xz.y((0)) A ; 

«s(s(0))»a = Xx.Xy.Xz.y((s(0))) A ; 

((s))a = Xw.Xx.Xy.Xz.yw. 

We now take a look at [s]a- By definition: 

[s] A = GONl = Xy.yNl 2 Nl 2 Ll 

^Xy.ydXxi.CONljCONDiXzUXxi.CONljiCONl))!. 

This A-term indeed "simulates" the successor constructor, when fed with an input. Suppose 
u G C(ADV), then: 

[s]a((0))a ^ 4 (Xx^CONl^CONl = (A^.CCW^X^a 

««(0)»a; 

[s]a«b(«)»a ^ 4 (Xzx.iXxi.CONljiCONlMu^A (A^Azi.CGW^Hsfa^A 

-> «b(b(«))»a; 

[s]a-L ^ 4 J-. 

Finally, consider add, the only function symbol of X.4x>£>. By definition: 

[add] A = H X V X ee £ri(Ax 1 .Ayi.Aite.Pi4r (0>!B) ,( 8 ( a! ), 1 , ) i/ 1 y2W 1 1 Wi 2 ) 
It is easy to verify that, by Lemma |3.4| 

[add] A ±((t»A ^* -L; 
[add] A ((s(0))) A ((s(s(0)))) A ^* «B(8(B(0)))» A . 
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4. Graph Representation 

The previous two sections proved the main simulation result of the paper. To complete the 
picture, we show in this section that the unitary cost model for the (weak call-by-value) 
A-calculus (and hence the number of rewriting in a OCRSs) is polynomially related to 
the actual cost of implementing those reduction^. We do so by introducing term graph 
rewriting, following [BE G + 86j but adapting the framework to call-by-value constructor 
rewriting. Contrarily to what we did in Section we will stay abstract here: our attention 
will not be restricted to the particular graph rewrite system that is needed to implement 
reduction in the A-calculus. 

We refer the reader to our [DLM10J for more details on efficient simulations between 
term graph rewriting and constructor term rewriting, both under innermost (i.e., call- by- 
value) and outermost (i.e., call-by-name) reduction strategies. 

Definition 4.1 (Labelled Graph). Given a signature S, a labelled graph over £ consists 
of a directed acyclic graph together with an ordering on the outgoing edges of each node 
and a (partial) labelling of nodes with symbols from £ such that the out-degree of each 
node matches the arity of the corresponding symbols (and is if the labelling is undefined). 
Formally, a labelled graph is a triple G = (V, a, 5) where: 

• V is a set of vertices. 

• a : V —> V* is a (total) ordering function. 

• 5 : V — V is a (partial) labelling function such that the length of a(v) is the arity of 
8(y) if 5(v) is defined and is otherwise. 

A labelled graph (V, a, 5) is closed iff 5 is a total function. 

Consider the signature £ = {f, g,h, p}, where arities of f, g, h,p are 2, 1, 0, 2 respec- 
tively, and g, h, p are constructors. Examples of labelled graphs over the signature £ are 
the following ones: 

f K s f 




g h _L g 



\ \ 

J_ J_ 

The symbol _L denotes vertices where the underlying labelling function is undefined (and, 
as a consequence, no edge departs from such vertices). Their role is similar to the one of 
variables in terms. 

If one of the vertices of a labelled graph is selected as the root, we obtain a term graph: 

Definition 4.2 (Term Graph). A term graph, is a quadruple G = (V, a, 5, r), where (V, a, 5) 
is a labelled graph and r G V is the root of the term graph. 

2 As mentioned in the introduction, see |SG M02 for another proof of this with other means. 
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The following are graphic representations of some term graphs. 

fi / 

I 
f 

/\ 

g h 

I 

_L 

The root is the only vertex drawn inside a circle. 

There are some classes of paths which are particularly relevant for our purposes. 

Definition 4.3 (Path). A path vi,...,v n in a labelled graph G = (V,a,S) is said to 
be: 

• A constructor path iff for every 1 < i < n, the symbol S(vi) is a constructor; 

• A pattern path iff for every 1 < i < n, 5(vi) is either a constructor symbol or is undefined; 

• A left path iff n > 1, the symbol 5(vi) is a function symbol and V2, ■ ■ ■ , v n is a pattern 
path. 

Definition 4.4 (Homomorphism). A homomorphism between two labelled graphs G = 
(Vg,o:c,8g) and H = (Vh,cyh,Sh) over the same signature S is a function </? from Vq to 
Vff preserving the term graph structure. In particular 

6 H (<p(v)) = 6 G (v) 

a H (<p(v)) = <p*(a G (v)) 

for any v G dom(S), where is the obvious generalization of if to sequences of vertices. An 
homomorphism between two term graphs G = (Vg, «g, Sq, tq) and H = (Vh , ctjf , <^ff , r^) 
is a homomorphism between (Vcacfc) and (V/j , a^, <5ij) such that ^p{rc) = th- Two 
labelled graphs G and H are isomorphic iff there is a bijective homomorphism from G to 
H; in this case, we write G = H. Similarly for term graphs. 

In the following, we will consider term graphs modulo isomorphism, i.e., G = H iff 
G = H. Observe that two isomorphic term graphs have the same graphical representation. 

Definition 4.5 (Graph Rewrite Rule). A graph rewrite rule over a signature S is a triple 
p = (G, r, s) such that: 

• G is a labelled graph; 

• r, s are vertices of G, called the left root and the right root of p, respectively. 

• Any path starting in r is a left path. 
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The following are examples of graph rewrite rules, assuming a to be a function symbol 
and b, c, d to be constructors: 




h 



P -L JL 




g h 

X 

±_ 

Definition 4.6 (Subgraph). Given a labelled graph G = (Vg,(Xg,o~g) and any vertex v <G 
Vq, the subgraph of G rooted at v, denoted G I v, is the term graph {Vg\.v, &Giv, $Glv, r Glv) 
where 

• Vqiv is the subset of Vq whose elements are vertices which are reachable from v in G. 

• ctGiv and 8giv are the appropriate restrictions of and Sq to Vq^v 

• r Giv is v. 

Definition 4.7 (Redex). Given a labelled graph G, a redex for G is a pair (p,<p), where p 
is a rewrite rule (H,r,s) and <p is a homomorphism between H \,r and G such that for any 
vertex -u G with v ^ dom(5//| r ), any path starting in c^(t>) is a constructor path. 

The last condition in the definition of a redex is needed to capture the call-by-value 
nature of the rewriting process. 

Given a term graph G and a redex ((H, r, s),ip), the result of firing the redex is another 
term graph obtained by successively applying the following three steps to G: 

1. The build phase: create an isomorphic copy of the portion of H ^ s not contained in 
H 4- r, and add it to G, obtaining J. The underlying ordering and labelling functions 
are defined in the natural way. 

2. The redirection phase: all edges in J pointing to <f(r) are replaced by edges pointing to 
the copy of s. If ip(r) is the root of G, then the root of the newly created graph will be 
the newly created copy of s. The graph K is obtained. 

3. The garbage collection phase: all vertices which are not accessible from the root of K 
are removed. The graph / is obtained. 

(H,r,s) 

We will write G — ^-V / (or simply G — >■ /, if this does not cause ambiguity) in this case. 
As an example, consider the term graph G and the rewrite rule p = (H, r, s): 




h ± h 



G 



P 
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There is a homomorphism ip from H \ r to G. In particular, ip maps r to the rightmost 
vertex in G. Applying the build phase and the redirection phase we get J and K as follows: 




Finally, applying the garbage collection phase, we get the result of firing the redex (p, ip): 




Definition 4.8. A constructor graph rewrite system (CGRS) over a signature £ consists 
of a set Q of graph rewrite rules on S. 



4.1. From Term Rewriting to Graph Rewriting. Any term t over a signature £ can 
be turned into a graph [t] in the obvious way: take as [t] the abstract syntax tree of t, where 
vertices are in one-to-one correspondence with symbol occurrences in t. Conversely, any 
term graph G over £ can be turned into a term (G) over S by simply unfolding the graph, 
that is applying (the label of) any vertex to (the terms obtained as unfolding of) its sons 
(remember: we only consider acyclic graphs here). We omit the boring formal definitions 
of both [•] and (•); it is clear that for any term t, ([t]) = t, while in general [(G)] is not equal 
to G, since the sharing present in G is lost during the unfolding. 

Definition 4.9. Given a constructor rewriting system 1Z over S, the corresponding con- 
structor graph rewriting system [1Z] is defined by translating the terms with [•] and by 
translating any term rewrite rule t — > u over £ into a graph rewrite rule (G, r, s) as fol- 
lows: 

• Take the graphs [t] and [u] (which are trees, in fact). 

• From the union of these two trees, share those nodes representing the same variable in 
t and u. This is G. 

• Take r to be the root of t in G and s to be the root of u in G. 

As an example, consider the rewrite rule 

f(g(x),y) ->• g(f(y,f(y,x))). 
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Its translation as a graph rewrite rule is the following: 



_L 




f 



Given a constructor rewriting system 7Z, it is easy to realize that the following invariant 
is preserved while performing rewriting in [JZ]: whenever any vertex v can be reached by two 
distinct paths starting at the root (i.e., v is shared), any path starting at v is a constructor 
path. A term graph satisfying this invariant is said to be constructor- shared. 

Constructor-sharedness holds for term graphs coming from terms and is preserved by 
graph rewriting: 

Lemma 4.10. For every closed term t, [t] is constructor- shared. Moreover, if G is closed 
and constructor- shared and G — >• I in [1Z], then I is constructor- shared. 

Proof. The fact that [t] is constructor-shared for every t follows from the way the [•] map 
is defined: it does not introduce any sharing. Now, suppose G is constructor-shared and 



where (H, r, s) corresponds to a term rewrite rule t — > u. The term graph J obtained 
from G by the build phase is itself constructor-shared: it is obtained from G by adding 
some new nodes, namely an isomorphic copy of the portion of H X s not contained in 
H X r. Notice that J is constructor-shared in a stronger sense: any vertex which can 
be reached from the newly created copy of s by two distinct paths must be a constructor 
path. This is a consequence of {H, r, s) being a graph rewrite rule corresponding to a term 
rewrite rule t —¥ u, where the only shared vertices are those where the labelling function is 
undefined. The redirection phase preserves itself constructor-sharedness, because only one 
pointer is redirected (the vertex is labelled by a function symbol) and the destination of 
this redirection is a vertex (the newly created copy of s) which had no edge incident to it. 
Clearly, the garbage collection phase preserves constructor-sharedness. □ 

Lemma 4.11. A closed term graph G in [JZ] is a normal form iff (G) is a normal form. 

Proof. Clearly, if a closed term graph G is in normal form, then (G) is a term in normal 
form, because each redex in G translates to a redex in (G). On the other hand, if (G) is 
in normal form, then G is in normal form: each redex in (G) translates back to a redex in 



Reduction at the level of graphs correctly simulates reduction at the level of terms, but 
only if the underlying graphs are constructor shared: 

Lemma 4.12. If G is closed and constructor-shared, and G — >■ I in [JZ], then (G) — > (I) 
in 1Z. 

Proof. The fact that each reduction step starting in G can be mimicked in (G) is known 
from the literature. If G is constructor-shared, then the simulation is done in exactly one 
reduction step, because any redex in a constructor-shared term graph cannot be shared. □ 




G. 



□ 
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When G in not constructor-shared, a counterexample can be easily built. Consider the 
term rewrite rule f(h, h) — > h and the following term graph: 




h h 



It corresponds to f (f (h, h), f (h, h)), and it is not constructor-shared, since the shared vertex 
a is not a constructor. It rewrites in one step to 




h 

while the term f (f (h, h), f (h, h)) rewrites to f(h, h) in two steps. 

As can be expected, graph reduction is also complete with respect to term reduction, 
with the only proviso that term graphs must be constructor-shared: 

Lemma 4.13. If t — >■ u in TZ, G is constructor-shared and {G) = t, then G — > I in [TZ], 
where {1} = u. □ 

Theorem 4.14 (Graph Reducibility). For every constructor rewrite system TZ over S and 
for every term t over T,, the following two conditions are equivalent: 

1. t — 7- n u in TZ, where u is in normal form; 

2. [t] — > n G in [TZ], where G is in normal form and (G) = u. 

Proof. Suppose t — > n u, where u is in normal form. Then, by applying Lemma I4.13| we 
obtain a term graph G such that [t] — > n G and (G) = u. By Lemma 14.101 G is constructor- 
shared and, by Lemma l4.11t it is in normal form. Now, suppose [t] — >- n G where (G) = u and 
G is in normal form. By applying n times Lemma 14.121 we obtain that {[t]) — > n (G) = u. 
But ([t]) = t and u is a normal form by Lemma 14.111 since [t] and G are constructor shared 
due to Lemma 14.101 □ 

There are term rewrite systems which are not graph reducible, i.e. for which the two 
conditions of Theorem 14.141 are not equivalent (see [B EG + 86] ). However, any orthogonal 
constructor rewrite system is graph reducible, due to the strict constraints on the shape 
of rewrite rules [Plu90| . This result can be considered as a by-product of our analysis, for 
which graph rewriting is only instrumental. 

4.2. Lambda- Terms Can Be Efficiently Reduced by Graph Rewriting. As a corol- 
lary of Theorems 14. 141 and 12.91 we may reduce A-terms using term graphs. To this purpose, 
we apply the construction of the previous section to the OCRS $ that we defined in Sec- 
tion |1 Let then 9 = [*]: 

Corollary 4.15. Let M € A be a closed \-term. The following two conditions are equiva- 
lent: 

1. M — N where N is in normal form; 

2. [[M]$] — )- n G where (((G))) a = N and G is in normal form. □ 
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Let us now analyze more closely the combinatorics of graph rewriting in 0, so that we 
can obtain information on the efficiency of this simulation. 

• Consider a closed A-term M and a term graph G such that [[M]$] — >* G. By Proposi- 
tion [1] and Lemma 14.121 for every constructor c-^jv appearing as a label of a vertex in 
G, N is a subterm of M. 

• As a consequence, if [[M]$] — >* G — > H, then the difference \H\ — \G\ cannot be too big: 
at most \M\. Therefore, if [[M]$] — > n G then \G\ < (n + 1)|M|. Here, we exploit in an 
essential way the possibility of sharing constructors. 

• Whenever [[M]$] — G, computing a graph H such that G —> H takes polynomial time 
in which is itself polynomially bounded by n and \M\. 

Hence (recall that Time v {M) is the number of weak call- by- value beta steps to normal 
form) : 

Theorem 4.16. There is a polynomial p : N 2 — > N such that for every X-term M, the 
normal form of [[M]$] can be computed in time at most p(\M\, Time v (M)). O 

This cannot be achieved when using explicit representations of A-terms. Moreover, 
reading back a A-term from a term graph can take exponential time. 

We can complement Theorem 14.161 with a completeness statement — any universal 
computational model with an invariant cost model can be embedded in the A-calculus with 
a polynomial overhead. We can exploit for this the analogous result we proved in [DLM08J 
(Section 4, Theorem 1) — the unitary cost model is easily proved to be more parsimonious 
than the difference cost model considered in [DLM08]. 

Theorem 4.17. Let f : X* — > X* be computed by a Turing machine Ai in time g. Then, 
there are a X-term N_m and a suitable encoding r - n : X* — > A such that Nj^fv^ normalizes 
to r f(v)~ l in 0(g(\v\)) beta steps. □ 

The encoding r - n mentioned in the theorem depends only on (the cardinality of) X (but 
not on the Turing machine) . Interestingly enough it exploits once again the scheme that we 
used in Definition 13. II encode the empty string e as a zero-ary constructor, and any symbol 
in X as a unary constructor (see [DLM08] for details). 

5. Variations: Call-by-Name Reduction 

In the previous sections, A-calculus was endowed with weak call-by-value reduction. The 
same technique, however, can be applied to weak call-by-name reduction, as we will sketch 
in this section. A is now endowed with a relation — >h defined as follows: 

M —>h N 

(Xx.M)N -> h M{N/x} ML NL 

Similarly to the call- by- value case, Timeh(M) stands for the number of reduction steps to 
the normal form of M (if any). Since the relation — is deterministic (i.e., functional), 
Timeh(M) is well-defined. 

We need another OCRS, called ^, which is similar to $ but designed to simulate weak 
call-by-name reduction: 

• The signature X^, includes the binary function symbol app and constructor symbols c x ,M 
for every M € A and every x E T, exactly as X$. Moreover, there is another binary 
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constructor symbol capp. To every term M G A we can associate terms {M}y, [M]^ 6 
V(*,T) as follows: 

{x}\& = X 

{Xx.M}^, = c x ^m(xi, . . . , x n ), where FV(Ax.M) = x±, x n 

{MN}* = capp({M}^,{N} 9 ) 

[x]-qj = X 

[Xx.M] 

* — c x,m(x\, . . . , x n ), where FV( Ax. M) — x\, x n 
[MN]y = app([M]*,{A^) 

Notice that {-}^ maps A-terms to constructor terms, while terms obtained via [■]$ can 
contain function symbols. The "official" translation of a term M is thus [M]#, where 
only the applications "on the spine" of M are encoded with app. All other applications 
are frozen by the constructor capp. 
• The rewrite rules in IZq? are all the rules in the following form: 

app(c 2i2 ,capp(u;,/)) -»• app(w,f) 

app(c^,c Xj Af(a;i, • • • ,x n )) ->■ c Xj m(xi, . . . ,x n ) 

app(c ZtW (capp( f,g)),h) -»• app(/,^) 

app(c ZjW (c Xj M(a;i, • • • ,x n )),/i) -)• c Xi m(xi, . . . ,x n ) 

app^Ar^i,...,^),?/) -> [A^]* 

where M ranges over A-terms, A^ ranges over abstractions and applications, FV(Ax.M) = 
x±, . . . ,x n and FY(Xy.N) = yi, . . . , y m . These rewrite rules are said to be ordinary rules. 
We also need the following administrative rule: 

app(capp(x, y), z) -> app(app(x, y), z). 

The CTRS ^ is more complicated than because we need to force reduction to happen only 
in head position. The applications app (on the spine) may be fired immediately. Observe, 
however, that the main rewriting rule (the last of the ordinary ones) is restricted to those 
c Vt N where A^ is not a variable. When A^ is a single variable, the corresponding beta redex 
would be either (Xx.x)L or (Xz.w)L, with w free. In the former case, an application at the 
top level of L (encoded as a capp at this point) would become the top level application of 
the spine of the reduct: the first ordinary reduction rule handles this case, unfreezing capp 
into app. When, on the other hand, the encoded redex is (Xz.w)L, we do not need to worry 
for L, which will be discarded, but in the term-reduction we must take care of the eventual 
substitution that may occur for w: the term substituted for w may have a top level frozen 
application capp that must be converted into an app — this is the role of the third ordinary 
reduction rule. The second and fourth reduction rules just handle the remaining cases (they 
would be instances of the last ordinary rule if this was not restricted to the non-variable 
cases). A last remark on the administrative rule. There are never administrative redexes in 
the translation [M]^ of a term. During reduction, however, by the effect of the other rules 
a frozen application (a capp) may appear on the spine. The administrative rule recognizes 
this situation and unfreezes the application. 
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As usual, to every term t G V(^, T) we can associate a term (t)\: 

(x) A = x 
(app(u, v))a = (capp(u,v))\ = (u)a(v)a 

{c x ,m(H, . . .t n ))A = (Ax.M){(ti) A /xi, . . . , {t n )A/x n } 

where FV(Ax.M) = x±, . . . ,x n . A term t G T(^f) is canonical if either t = (ii • • • , t n ) G 
C(^f) or t = app(tt, v) where u is canonical and v G C(*&). 

Lemma 5.1. For every closed M G A, [M]q, is canonical. 

Proof. By a straightforward induction on M. □ 
The obvious variation on Equation (|2.1|) holds here: 

([M]*{ti/Xi, . . • , W^})a = M {(ti) A /x ll . . . , (t„) A /x n }. (5.1) 

\& mimics call-by-name reduction in much the same way $ mimics call-by-value reduction. 
However, one reduction step in the A-calculus corresponds to n > 1 steps in ^, although n 
is kept under control: 

Lemma 5.2. Suppose that t G T(^) is canonical and that t — > -u. T/ien i/iere is a natural 
number n such that: 

i- (*) A ^ («>a; 

2. There is a canonical term v G T(^) smc/i that u — > n v; 
3- Mapp = l n lapp + m whenever u — > m w and m < n; 
4. (w)a = {u)a whenever u — > m w and m < n. 

Proof. A term t is said to be semi- canonical iff t = app(n, v), where v G C(^) and u is either 
semi-canonical or is itself an element of C(*&). We now prove that if t is semi-canonical, 
there there are a natural number n and a canonical term u such that: 

• t u; 

• Mapp = Napp + fn whenever t — > m v and m < n; 

• { v )a = (*)a whenever t — > m v and m < n. 

We proceed by induction on \t\. By definition t is in the form &pp(w,d); we have three 
cases: 

• w is semi-canonical. Then, we get what we want by induction hypothesis. 

• w is in C( V I / ) and has the form c X) M{t\, ■ ■ ■ ,t m ). Then, n = and t is itself canonical. 

• w is in C( V I / ) and has the form capp(e, /). Then 

t = app(capp(e, f),d) ->• app(app(e, /), d). 
Apply now the induction hypothesis to app(e, /) (since its length is strictly smaller than 
1*1)- 

We can now proceed as in Lemma [2. 7\ since whenever t rewrites to u by one of the ordinary 
rules, u is semi-canonical. □ 

Lemma 5.3. A canonical term t G T{^) is in normal form iff if) a is in normal form. 

Proof. We first prove that any canonical normal form t can be written as c x ,M{ti, ■ ■ ■ ,t n ), 
where ti,...,t n G C(^). We proceed by induction on t: 

• If t = c X) M{ti, ■ ■ ■ , t n ), then the thesis holds. 
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• If t = app(u, v), then u is canonical and in normal form, hence in the form c x ,M(tl, ■ ■ ■ , t n ) 
by induction hypothesis. As a consequence, t is not a normal form, which is a contra- 
diction. 

We can now prove the statement of the lemma, by distinguishing two cases: 

• If t = c x ^M(ti, ■ ■ ■ ,t n ), where t\,. . . ,t n € C(^), then t is in normal form and (t)\ is an 
abstraction, hence a normal form. 

• If t = app(it, v), then t cannot be a normal form, since u is canonical and in normal 
form and, as a consequence, it can be written as c^m^i, • • • ,t n ). 

This concludes the proof. □ 

Observe that this property holds only if t is canonical: a non-canonical term can reduce 
to another one (canonical or not) even if the underlying A-term is a normal form. 

Lemma 5.4. If M N, t is canonical and {t)\ = M, then t — >• u, where (u)a = N and 

1^*1 app ~i~ 1 ^ |^|app- 

Proof. Analogous to the one of Lemma [2. 8 1 for the first part of the statement. For the bound 
on the number of app, argument similarly to the proof of Lemma 15.21 □ 

The slight mismatch between call-by-name reduction in A and reduction in \& is anyway 
harmless globally. As we now show, the total number of reduction steps in \& is at most 
two times as large as the total number of call-by-name reduction steps in A. 

Theorem 5.5 (Term Reducibility) . Let M 6 A be a closed term. The following two 
conditions are equivalent: 

1. M — >V; N where N is in normal form; 

2. [M]y — > m t where (t)\ = N and t is in normal form. 
Moreover n < m < In. 

Proof. Suppose M — >^ N, where N is in normal form. M is closed and, by Lemma 15.11 
[M] <i< is canonical. By iterating over Lemma 15.21 and Lemma 15.41 we obtain the existence 
of a term t such that (t}\ = u, t is in normal form and [M]^ —> m t, where m > n and 

|*|a PP - I [M] 4- 1 a pp > (m - n) - n. 

Since |£| app = (t is in normal form), m < 2n. If [M]-$ — > m t where (t)\ = N and t is in 
normal form, then by iterating over Lemma f5.2l we obtain that M — >^ N where n < m < 2n 
and TV is in normal form. □ 

H is the graph rewrite system corresponding to ^, in the sense of Section HI Exactly as 
for the call-by-value case, computing the normal form of (the graph representation of) any 
term takes time polynomial in the number of reduction steps to normal form: 

Theorem 5.6. There is a polynomial p : N 2 — > N such that for every X-term M, the normal 
form of [[M]-$]s can be computed in time at most p(\M\, Timeh(M)). 

On the other hand, we cannot hope to directly reuse the results in Section [3] when 
proving the existence of an embedding of OCRSs into weak call-by-name A-calculus: the 
same A-term can have distinct normal forms in the two cases. It is widely known, however, 
that a continuation-passing translation can be used to simulate call-by-value reduction by 
call-by-name reduction |Plo75| . The only missing tale is about the relative performances: 
do terms obtained via the CPS translation reduce (in call-by-name) to their normal forms 
in a number of steps which is comparable to the number of (call-by-value) steps to normal 
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form for the original terms? We conjecture the answer is "y es '\ but we leave the task of 
proving that to a future work. 

6. Conclusions 

We have shown that the most naive cost models for weak call-by-value and call-by-name 
A-calculus (each beta-reduction step has unitary cost) and orthogonal constructor term 
rewriting (each rule application has unitary cost) are linearly related. Since, in turn, this 
cost model for A-calculus is polynomially related to the actual cost of reducing a A-term on 
a Turing machine, the two machine models we considered are both reasonable machines, 
when endowed with their natural, intrinsic cost models (see also Gurevich's opus on Abstract 
State Machine simulation "at the same level of abstraction", e.g. |Gur01| ). This strong (the 
embeddings we consider are compositional), complexity-preserving equivalence between a 
first-order and a higher-order model is the most important technical result of the paper. 

Ongoing and future work includes the investigation of how much of this simulation 
could be recovered either in a typed setting (see |SU99| for some of the difficulties), or in 
the case of A-calculus with strong reduction, where we reduce under an abstraction. Novel 
techniques have to be developed, since the analysis of the present paper cannot be easily 
extended to these cases. 
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